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Abstract —New methods that exploit sparse structures arising 
in smart grid networks are proposed for the state estimation 
problem when data injection attacks are present. First, con¬ 
struction strategies for unohservable sparse data injection attacks 
on power grids are proposed for an attacker with access to all 
network information and nodes. Specifically, novel formulations 
for the optimization problem that provide a flexible design of the 
trade-off between performance and false alarm are proposed. 
In addition, the centralized case is extended to a distributed 
framework for both the estimation and attack problems. Different 
distributed scenarios are proposed depending on assumptions 
that lead to the spreading of the resources, network nodes and 
players. Consequently, for each of the presented frameworks a 
corresponding optimization problem is Introduced jointly with an 
algorithm to solve it. The validity of the presented procedures 
in real settings is studied through extensive simulations in the 
IEEE test systems. 

Index Terms —Smart grid security, false data Injection, dis¬ 
tributed optimization, sparse models, attack detection. 


I. Introduction 

Power networks are complex systems consisting of gen¬ 
erators and loads that are connected by transmission and 
distribution lines m. These systems can be modeled by 
complex networks, in which the generators and loads are 
represented by physically distributed nodes and power lines 
are represented by edges that connect the nodes. Because of 
the geographic and physical distribution of the nodes and the 
power transmission constraints El, various structural proper¬ 
ties of complex networks are observed in power networks El- 
For instance, the distribution of electrical distances of Eastern, 
Western and Texas interconnects in the North American power 
network obeys a power-law distribution, which leads to scale 
free and hierarchical network structures 0- 

The aforementioned structural properties of power networks 
constrain the way in which both attack and defense schemes 
are designed for the smart grid. Several attack vector con¬ 
struction and detection methods have been introduced using 
either centralized 01,11,0,1161 or distributed 0,0 models. 
Data sparsity properties have been analyzed for constructing 
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unobservable sparse attack vectors by Liu et al. 0 . Kosut et al. 
0 have introduced the relationship between attack detectabil¬ 
ity and network observability using a graph-theoretic model. 
Xie et al. a proposed a distributed wide-area state vector 
estimation algorithm which is also employed for bad data 
detection 0 . However, they do not exploit the sparsity and 
instead they define the state estimation problem as a weighted 
least squares (WLS) problem. Pasqualetti et al. nni solved 
a similar WLS problem using a measurement distributed 
decomposition method for distributed state estimation and 
attack detection. Yang et al. Ell have proposed a hierarchical 
architecture to construct sparse attack vectors using combi¬ 
natorial search methods. Vukovic et al. El have analyzed 
various mitigation schemes of data integrity attacks for state 
estimation. Recent advances for attack vector construction and 
state vector estimation methods in power systems have been 
reviewed in El and El- 

The centralized attack schemes proposed in this paper 
follow the undetectability criteria given in 0 . Lirst, sparse 
targeted false data injection attacks are introduced which 
provide a strategy for tampering with the measurements from 
meters in order to build a specific data injection vector. In 
the second proposed method, called strategic sparse attacks, 
the sparse attack vector is constructed by assuming that the 
attacker has control over only a set of measurements and that 
the system has secure measurements that cannot be considered 
in the construction of the attack vector. 

Since power grid networks are large scale networks, system 
monitoring and security control as envisioned for the smart 
grid are challenging problems. Therefore, decentralized op¬ 
tions in which the computational complexity is distributed 
throughout the network are desirable. Lor this reason, dis¬ 
tributed estimation techniques arise as strong candidates to 
incorporate adaptability to dynamic network topologies and 
flexible reconfiguration in case of sub-network faults. Addi¬ 
tionally, distributed estimation techniques do not require all 
network state information to be available to each group, which 
facilitates operating with limited knowledge about the state of 
the network. However, the distributed structure of the networks 
may lead to critical attacks. Lor instance, distributed and 
collective attacks to active nodes, which have higher numbers 
of connections than the rest of the nodes, may cause larger 
damage to the network (i.e., the group of nodes connected 
to active nodes), because of its scale-free and hierarchical 
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Structure 0, Ida. 

We introduce two distributed attack models that make use of 
the sparsity of the attack vectors. The first model. Distributed 
Sparse Attacks, assumes that the attacks are directed at clusters 
of measurements. In this setting, attackers have access to a 
subset of the measurements observed by the nodes in the 
cluster. The goal is to achieve a consensus on the design 
of the attack vectors by iteratively computing them for the 
measurements observed in each cluster. The second model. 
Collective Sparse Attacks, assumes that the network topology 
is known by the attackers and may access the measurements 
observed in the whole network. However, attacks occur in 
groups, i.e., state variables in the same group are attacked 
by the same attack vectors. 

In addition, we introduce two distributed state vector estima¬ 
tion methods from the perspective of the network operator. The 
first method. Distributed State Vector Estimation, considers 
the scale-free or hierarchical structure of the network, i.e., the 
observed measurements are grouped into clusters. Then, local 
state vector estimates are computed using local measurements 
by either local network operators or smart Phasor Measure¬ 
ment Units (PMUs). Using an iterative message-passing sparse 
optimization algorithm, each local operator or unit sends 
the estimated state vectors to centralized network processors, 
which update the state vector estimates. The second method, 
called Collaborative Sparse State Vector Estimation, assumes 
that different vector operators estimate a subset of state 
variables. For instance, different network operators may have 
expertise or special tools in order to estimate specific state 
variables and as a result, state vector variables are assumed 
to be distributed in groups and locally accessed by network 
operators. In this method, each network operator computes an 
estimate of the subset of the state variables using local data, 
and these estimates are then sent to a centralized operator in 
order to update their values. We analyze the proposed state 
vector estimation methods for attack detection and identifica¬ 
tion using a residuals test method in Section VII. 

All the optimization problems presented in this paper are 
solved using the Alternating Direction Method of Multipliers 
(ADMM) algorithm ifT^ . Parameter and stopping criteria 
selection methods of ADMM are given in m and ini. 
Moreover, convergence properties of ADMM are analyzed in 
HD and mi. 

In the next section, we review the unobservable false data 
injection and state vector estimation problems. Section III 
describes centralized sparse attack methods in which the sparse 
structure of the problem is exploited. In Section IV, we 
introduce distributed and collaborative state vector estimation 
methods. We introduce distributed and collective sparse attack 
models in Section V, and their computational complexity is 
analyzed in Section VI. We assess the validity of the proposed 
methods using real-world power systems in Section VII. The 
paper concludes with Section VIII. 

II. Problem Formulation 
A. System Model 

A review of the problem formulation of false data injection 
attacks and the state vector estimation problem for attacked 


systems follows. Consider the DC power flow state acquisition 
problem HI given by 

z = Hx + n, (1) 

where z e is the vector of measurements, x e is the 
state vector which consists of the voltage phase angles at the 
buses, H 6 is the measurement Jacobian matrix and 

n € R^ is the measurement noise. 

The goal of the network operator is to estimate the state 
vector and decide whether an attack is present. If the noise is 
normally distributed with zero mean and independent compo¬ 
nents, then the following estimator can be employed 11: 

x= (H^AHj-^H^Az , (2) 

where A is a diagonal matrix whose diagonal elements are 
given by An = and is the variance of the i-th 

measurement for i = The network operator decides 

that an attack is present if ||z - Hx||| > r, where 11-112 is the 
^ 2 -norm and t € R is a given threshold. If ||z-Hx||| < t then 
no attack is declared. 

The goal of the attacker is to inject a false data vector 
a e R^ into the measurements without being detected by the 
operator. Since the attack is performed by changing the values 
of a subset of all the measurements, the resulting observation 
model for the operator is 

z = Hx + a + n. (3) 

Note that for the attack vector Oi + Q, Vi e A, where A is the 
set of measurement variable indices with which the attacker 
tampers. On the other hand, the measurements over which the 
attacker has no control are the secure variables, = 0, Vi e 5. 
Note that S = A where (•) is the set complement operator and 
IA u iS| = A where | ■ | denotes set cardinality. 

Imposing the constraint a = He, where c € R^ is an injected 
data vector guarantees undetectability via residual tests since 
it lies in the column space of H HI, JS]. Note that 0 can be 
rewritten as 

z = Hx + n, (4) 

where x = x + c is what an operator unaware of the attack 
tries to estimate instead of the actual state vector x. 

B. Sparsity in the System 

Assuming that an attacker can tamper with a limited number 
of meters poses the optimization problem in a framework in 
which the attack vector is sparse. Specifically, if k meters 
are controlled by the attacker, then a is at most ^-sparse, 
i.e., ||a||o < k, where || • ||o is the Iq norm. In fT] Liu 
et al. prove the existence of unobservable attack vectors if 
k > N - D + 1. Finding the sparsest attack vector that satisfies 
a = He is computationally intractable in general. Surprisingly, 
the solution can be relaxed into a convex optimization problem 
by using the £i-norm as the objective function instead of the 
£o-norm 1191,11201. Based on sparse reconstruction techniques, 
Kim and Poor ||4| provide a greedy approach for the attack 
vector construction when a subset of the measurements is 
controlled by the attacker while the remaining measurements 
are secured. 
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A second scenario in which the sparsity of the system can be 
exploited is in the estimation of the state vector. Considering 
that system states are given by a random process {x(}, the 
components of the state vector that change significantly during 
an interval (f, t') are defined as 

Xt,t'= (5) 

where e defines the threshold for change significance. That 
being the case, the operator does not need to estimate all state 
variables for each time t. Assuming that it has a previous 
state estimate, xj, from time t, it can estimate the values that 
changed above the e threshold by realizing that 

yt' -yt = Hxt- - Hxt + Zf - zt (6) 

= H(xt'- xt) + zt'- zt (7) 

= + Zt'-Zt, (8) 

where <5* = Xt'-Xt has significantly changed variables given 

by indices Xt^f. By choosing the significance threshold, e, and 
the estimation time interval appropriately, 6 becomes a nearly 
sparse vector whose k largest components can be recovered by 
solving a standard compressed sensing problem of the form 

minimize ||5||i 

subject to II - y* - H5||| < 7, 

where 7 is a regularization parameter. 

An additional optimization constraint is imposed by the rank 
deficiency observed in the measurement Jacobian matrix H of 
several IEEE test systems, such as the IEEE 39-Bus IItTI . In 
Table H] we show the values of N, D, rank and R (the ratio 
of the number of nonzero elements of the entries of H) for 
test systems. We observe that 9-Bus, 14-Bus, 30-Bus and 39- 
Bus test systems are rank deficient. Although H matrices of 
57-Bus, 118-Bus, 300-Bus and 3375-Bus test systems are not 
rank deficient, their R values are greater than those of the 
rank-deficient matrices. Note that the sparseness increases as 
the system size increases. Eollowing the sparse nature of the 
system, © and dUi are formulated as ^i-norm optimization 
problems 0 . 

TABLE I; Rank values of measurement Jacobian matrices of 
IEEE test systems and the 3375-Bus Polish system plus - 
winter 2007-08 evening peak system. 


System 

N 

D 

Rank 

R 

9-Bus 

19 

9 

8 

72.00 % 

14-Bus 

34 

14 

3 

80.25 % 

30-Bus 

71 

30 

29 

90.89 % 

39-Bus 

85 

39 

38 

93.27 % 

57-Bus 

137 

57 

57 

95.22 % 

118-Bus 

304 

118 

118 

97.64 % 

300-Bus 

711 

300 

300 

99.09 % 

3375-Bus 

7536 

3375 

3375 

99.92 % 


III. Centralized Data Sparse Attacks 
A. Sparse Targeted False Data Injection Attacks 

Targeted Ealse Data Injection Attacks consist of attackers 
constructing false data injection vectors a corresponding to 
a given attack vector c. In this section, we introduce two 
models that employ LASSO and regressor selection algorithms 
to solve the targeted false data injection problem. 


1) Targeted LASSO Attacks: The sparseness of c is ex¬ 
ploited for targeted false data injection attacks I©, where 
Cj 6 R are fixed and defined by attackers Vj € I, for a 
set I of indices of the state vector variables that will be 
attacked. However, cj 6 R are randomly selected by the 
attacker according to a probability distribution Vj e I, where 
I is the set of off-target variables which are not specifically 
determined by the attackers. In other words, the attackers do 
not have control on the variables Cj e 1. Note that, \XuX\ = D. 

In order to compute the off-target and targeted attack 
vectors, we employ the following decomposition ©: 

a=Hc= (10) 

i€l FI 

where h/ is the Z-th column of H. Then, we define a sub¬ 
matrix of H as 

Jd-|x|)’ ^ ^ ^ - 

i < D -\X\ © and construct a vector b in the range space of 

the attacked measurements, such that b = Using 

this construction, we relate b to the measurements H^, such 
that -1 and y = B^b 

©. Therefore, we can compute a by solving y = B^a ©. 

We assume that given an attack vector a, the attack strategy 
of an attacker is to find a sparse a, such that y = B^a. 
Then, using li relaxations for sparse vector estimation CD, 
II 20 I . II 22 I . we introduce the following optimization problem 
to model the sparse false data injection attack: 

minimize ||a||i 

II Hi 

subject to y = B^a. 

(HUl is called basis pursuit and can be employed to find a 
sparse solution vector c im, Eo), Ea. In order to solve 
the optimization problem above using ADMM OSl, (EB is 
formulated as 


minimize /(a)+||/3||i 

subject to a-/3 = 0, ^ 

where /(a) is the indicator function for {a 6 R^ : y = B^a} 
and /3 6 R^ is the optimization variable. The sparsity of a is 
governed by ||/3||i using a scalar real number A > 0, which 
is a regularization parameter. Moreover, in order to reduce 
the probability of the attack being detected, ||y- B^a||| can 
be used as a cost function which results in the optimization 
problem 


minimize ||y- B^ajj^ + A||/3||i 
subject to a - /3 = 0. 

Problem (fT3l l is a LASSO optimization E 2 I and can be solved 
via ADMM DD as follows: 

Algorithm 1 (LASSO via ADMM): 

. INPUT: 

- Projection matrix defined by secure set B^ 

- Projected vector containing injected data y 

- Penalty parameter p 

- Maximum number of iterations t' 

. OUTPUT: 

- Attack vector candidate a a* 
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. PROCEDURE: 

1) Initialize t = 0, /3 = 0 and u = 0 

2) Compute ridge regression with penalty parameter p: 

a*.! ^ ((B^)^B^ + p/)’'((B^)^y + p(/3‘-u‘)) 

(14) 

3) Perform soft thresholding defined by proximity op¬ 
erator n„(0) = {f- k)+ - k)+, where ((/>)+ = 

max((^, 0): 

+ (15) 

4) Update: 

= u* 4-(16) 

5) Return to step 2 if a stopping criterion is not satisfied 
and t <t' 

2) Selective Targeted Attacks: The previous approach pro¬ 
vides an implicit control of the sparsity of a using parameter 

A. In the following, the sparsity is explicitly controlled by 
introducing the constraint ||a||o < k in (fTTT l in the optimization 
problem as 

minimize ||y-B^a ||2 
subject to ||a||o < /c . 

This optimization problem can also be solved via ADMM 
with a minor modification of Algorithm [T] Specifically, hard- 
thresholding 4- u‘) is employed in the update of /3, 

such that k largest magnitude elements of u* 4- a*’^^ are kept 
and zeros are assigned to the remaining elements M- 


and (Ti e formed by removing the i-th variable of 

c. Eollowing these definitions, the strategic sparse attack is 
defined as 


minimize cTi + h.-f II 1 

subject to Hf CTi 4- hf = 0 . 


(19) 


Since H and c are sparse, it follows that the problem can 
be reformulated as 

minimize ||cri||i 

subject to Hf CTi 4 - hf = 0 . 

Since (l20l l is a LASSO problem, we reformulate (l20l l as an 
ADMM optimization problem as follows: 

minimize ||Hfcri 4- hf ||| 4-A||ei||i 
subject to (Ti - ©i = 0 , 


where 6i € ^ is the optimization variable. In order to solve 

(ED, Algorithm [D can be employed with inputs Hf , y = 1 
and 9i. This procedure is repeated for i = 1,Z? in order to 
compute c = (ci,c^). 

2) Selective Strategic Sparse Attacks: As discussed in the 
previous section, the sparsity of can be bounded explic¬ 
itly by converting (l 20 l i to the equivalent regressor selection 
problem given by 

minimize ||Hf^, + hf||i 
subject to ||cri||o<fc. 


B. Strategic Sparse Attacks 

In this section, we propose two algorithms to compute the 
attack vector c based on the formulations of LASSO Attacks 
and Selective Sparse Attacks for the strategic sparse attack 
model case introduced by Kim and Poor a. To this end, 
we first redefine the sparse data injection attack problem for 
ADMM. Then, we solve the optimization problems using 
LASSO and Regressor Selection algorithms. 

1) Strategic Sparse Attacks with LASSO: In Strategic 
Sparse Attacks, a row-wise decomposition of the Jacobian 
measurement matrix is employed based on the set A of 
attacked measurement indices denoting the meters to which 
an attacker has access, and the set S of secure measure¬ 
ment indices, i.e., the indices of meters which cannot be 
tampered by an attacker. Specifically, a sub-matrix H*® = 
Vjj 6 iS, of H is constructed in order to 
represent the secure measurements, where is the ji-th 

row of H, such that H‘^c = 0. Similarly, sub-matrix H"^ is 
defined for attacked measurements. As a result, the attacker’s 
strategy is defined to find a solution c to the following 
optimization problem: 

minimize ||H'^c||o 

subject to H'^c = 0 , (18) 

l|c||oo > , 

where -0 > 0 is a given constant I?). 

Define as the i-th column vector of H, the sub-matrix 
Hi 6 formed by removing the Uth column of H, 


In this formalism, we relax the constraint in H'^c = 0 and 
introduce a sparsity constraint in the construction of attack 
vectors c, such that we compute an attack vector with at most k 
non-zero elements. The solution to (l22T i is the same as the one 
proposed for (l 2 TT i except for substituting the soft thresholding 
operator in step 3 by a hard thresholding. 


C. Computational Complexity of Centralized Sparse Attacks 

The optimization problems of the centralized sparse attacks 
are solved using Algorithm 1. The computational complexity 
of the algorithm is dominated by the attack vector update 
step in (Ell which solves a ridge regression problem ESj . 
Therefore, the computational complexity of the algorithm is 
Tie where 

1) a = min(7V, |I|) for targeted attacks given in Section 
III.A, and 

2) a = min( D-1), for strategic attacks given in Section 
III.B. 

Note that, the computational complexity of the algorithm is 
increased by an additional term D (the dimension of the attack 
vector) to 0(t'Da^) for strategic attacks, since the algorithm 
is implemented D times. 

In the implementation, the running or iteration time t' can 
be relaxed by using performance based early stopping criteria 
as suggested in m- 
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IV. Distributed and Collaborative Sparse State 
Vector Estimation 

A sparse state vector estimation model, called Distributed 
Sparse State Vector Estimation, is first introduced in order 
to estimate the state vectors under attack on the network 
measurements using an instance distributed LASSO algorithm. 
In the second model, called Collaborative Sparse State Vector 
Estimation, we assume that the topological information of the 
network and the measurements are available to the network 
operator. However, the network operator can choose to process 
different groups of state vectors using the Group LASSO 
algorithm. We solve the optimization problems using the 
ADMM algorithm. 

A. Distributed Sparse State Vector Estimation 

Measurements are distributed in the network and usually 
form clusters following the topological properties of the 
network. Additionally, observation vectors and measurement 
matrices are partitioned into G blocks denoted by Qi with 
\Gi\ = Ni for z = 1,..., G. As a result, the attacks can also be 
partitioned. Taking this into account, dill can be rewritten as 


Zi 


Hi 


ai 


ni 




X + 


+ 


ZG 


Hg 


aG 


Hg 


where Zi € is the measurement observed in the z-th cluster 
of nodes through measurement matrix Hj € and noise 

Wi € and which is under attack sn 6 R^* with z = 1,..., G. 
For each cluster, we consider the penalty function 

/, = ||z,-H,x,||^ (24) 

where is the state vector estimated at cluster i. Note that 

/e||z-hx||2 = |;/,. 

i=l 

Therefore, we can write the distributed optimization problem 
in the following way: 

G 

minimize '^fi + g{f3) (25) 

i=l 

subject to 5ti-f3 = 0, z=l,...,G, (26) 

where /3 e R^ is the optimization variable, p(/3) = A||/3||i 
is the regularization function and A e R is the regularization 
parameter, which controls the sparsity of the solution vec¬ 
tor. Since network operators accessing the local data should 
agree on the estimated state vector, we introduce a consensus 
constraint in ( |26] |. In other words, ( |2^ is considered as a 
global consensus problem in which (3 is used as the global 
optimization variable. 

We solve (l26l l using an ADMM implementation as described 
in the following algorithm. 

Algorithm 2 (Distributed Estimation via ADMM): 

. INPUT: 

- Projection matrix H 

- State measurements z 


- Set of clusters {Gi}%i 

- Penalty parameter p 

- Maximum number of iterations t' 

. OUTPUT: 

- Estimated state vector x e x* 

. PROCEDURE: 

1) Initialize f = 0, /3° = 0, u° = 0. 

2) For z = 1,..., G compute the Tikhonov-regularized 
least squares estimate with penalty parameter p 
given by 

x^ = (HfH, + pi)-' (Hfz, + p(/3‘ - u^). 

(27) 

3) Perform a soft thresholding given by 

= + (28) 

where the G proximity operator is defined as 

= {f-i^)+-{-f- k)+ (29) 

and ((^)+ = max((^, 0). 

4) For z = 1,..., G update 

= u* + x*+i -(30) 

5) Return to step 2 if the halting criterion is not 
satisfied and t < t'. 

Note that is a sparse matrix or vector (depending on Gi)- 
Still, (HfH, + pi) is invertible since p > 0. 

Algorithm and optimization variables are initialized in the 
first step of the algorithm. In the second step, each network 
operator computes a local estimate using Tikhonov-regularized 
least squares ll24ll . Il25]l . Then the local estimates are gathered 
to update the global variable (3 in the third step. Finally, the 
updated (3 is distributed or broadcast to the clusters to update 
the dual variables u^, Vz = 1,...,G, in the fourth step, and 
the halting criterion is checked in the last step. 


B. Collaborative Sparse State Vector Estimation 

In the distributed sparse attacks scenario, measurements 
are assumed to be distributed across clusters and operators 
have access only to local measurements. Alternatively, when 
collective sparse attacks are considered, operators know the 
whole topology of the network and the Jacobian measurement 
matrix H. However, in a distributed framework operators 
observe a subset of state vector variables, i.e., each operator 
may observe different groups of buses. 

In this setting, the observation model Q can be rewritten 
as 


z=[Hi-Hg] 


xi 


XG 


+ a + n , 


(31) 


where z 6 R'^ is the measurement vector, x^ € R-®’ is the 
state vector, n € R'^ is a noise vector and Hi 6 is 

the Jacobian measurement submatrix formed by selecting the 
columns given by the indices of the subset of state variables 
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assigned to cluster i. Given this structure, the optimization 
problem can be stated as 

G 

minimize ||Hx-z||i + AX;il**l| 2 . (32) 

i=l 

By introducing an optimization variable v 6 it follows 
that 


G 


minimize 

||Hv-z||| + A^ ||x *||2 

2=1 

(33) 

subject to 

Xi - Vi = 0 , i = 1,..., G , 

(34) 


is equivalent to ( |32] |. where is the estimate of v for X, M- 
In order to solve (l34l i. the proposed ADMM implementation 
is described below. 

Algorithm 3 (Collaborative Estimation via ADMM): 

. INPUT; 

- Projection matrix H 

- State measurements z 

- Set of clusters {Gi}^i 

- Penalty parameter p 

- Maximum number of iterations t' 

. OUTPUT: 

- Estimated state vector x e x* 

. PROCEDURE: 

1) Initialize f = 0, /3° = 0, v° = 0, 0° = 0 and x° = 0. 

2) For i= 1,..., G compute 

= argmin(p|| 0-||2 + A||x*|| 2 ), (35) 

where 0| = (x^ - x-) - v* + Hx* + u‘ and 

3) Update 

= 77 —(z + pHx‘^\pu*), (36) 

G + p 

= u‘ + . (37) 

4) Return to step 2 if the halting criterion is not 
satisfied and t < t' . 

V. Distributed and Collective Sparse Attacks 

In Section III, we introduced centralized sparse attack 
methods. In this section, two distributed attack models are 
proposed in order to employ sparse attacks in a distributed 
framework. For this purpose, the structure of the measurements 
and the attack vectors is redefined, followed by a formulation 
of the false data injection problem as a distributed sparse 
optimization problem. 

The proposed distributed attack models are motivated by 
two distributed attack scenarios. 

1) In Distributed Sparse Attacks, measurements are as¬ 
sumed to be distributed in the network and may form 
clusters following the topological properties of the net¬ 
work. Therefore, different attackers located in different 
clusters can construct attack vectors by just analyzing 
the local measurements observed in the clusters. 


2) Collective Sparse Attacks model assumes that attackers 
know the whole topology of the network and the Jaco¬ 
bian measurement matrix H. However, in this case the 
attacks are directed at a group of state vector variables 
distributed in the network, i.e., each attack injects false 
data into the state vector variables of the corresponding 
cluster. 

Although linear sparse attacks are considered for the imple¬ 
mentation of distributed attacks in this work, the proposed par¬ 
allelization and distributed processing strategies can be used 
as design patterns for developing distributed sparse targeted 
false data injection attacks and strategic sparse attacks. 


A. Distributed Sparse Attacks 

A linear sparse attack model is considered, in which given 
an attack vector, a, the attack strategy is to find a sparse 
injection vector, c, such that a = He HI. Using an £i 
relaxation for sparse vector estimation IT^ . Il 20 l . Il 22 l . the 
following optimization problem is considered; 

minimize ||c||i 

subject to a = He. ^ 

As noted before, measurements are assumed to be dis¬ 
tributed in the network and may form clusters following 
the topological properties of the network. Similar to the 
partitioning presented in the previous section, the Jacobian 
measurement matrix is partitioned into G number of subma¬ 
trices, which results in a partitioning of the attack vector given 
by 


ai 


Hi 



Hg. 


Note that, (|39] | can also be employed in the distributed false 
data vector construction described in ( | 2 ^ . 

In order to solve ( |38] | in the distributed form set by (l39b 
using a distributed optimization algorithm, the loss function is 
assumed to be separable, such that 

/,= ||a,-H,c,||2. (40) 

Note that, Ei=i fi = /■ Moreover, the optimization problem 
(l38l l is assumed to be feasible M- Therefore, the distributed 
optimization problem for (l38T l can be reformulated as 

minimize T,f=i ft + (41) 

subject to Ci - <p = 0, i = 1 ,..., G , (42) 

where (p 6 is the optimization variable, g{(p) = A||^||i 
is the regularization function and A € K is the regularization 
parameter which controls the sparsity of the solution vector. 
Interestingly, the optimization problem (l42l) is the same as the 
one posed in ( |26] | and therefore. Algorithm |2] can be used to 
solve it. 
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B. Collective Sparse Attacks 


The collective sparse attacks model assumes that attackers 
know the whole topology of the network and the Jacobian 
measurement matrix H. However, in this case the attacks 
are directed at a group of state vector variables, i.e., each 
attack injects false data into the state vector variables of 
the corresponding cluster. Within this setting, (l38T l can be 
rearranged as 


a=[Hi-HG] 


Cl 

CG 


(43) 


where the injection vector 6 is computed by the i-th 
attacker using Jacobian measurement submatrix Hi for i = 
Ij-.-jG. Following this decomposition, the optimization can 
be posed as 


G 

minimize ||Hc-a||i + A^||c,|| 2 . (44) 

2=1 


Introducing the optimization variables, ijj e R.^, yields a new 
formulation 

G 

minimize IjHi/) - a||| + A ^ ||ci ||2 (45) 

i=l 

subject to Ci-xpi = 0,i=l,...,G, (46) 

where is the estimate of for c, an. In the same fashion 
as with the previous problem, the optimization problem (|46] | 
is the same as (l32l l and therefore. Algorithm |3] can be used to 
solve it. 


VI. Computational Complexity of Distributed 
Algorithms 

We solve the distributed optimization problems using two 
main approaches, namely measurement distributed and at¬ 
tribute distributed optimization as given in Algorithm 2 and 
Algorithm 3 respectively. In the measurement distributed ap¬ 
proach, we assume that the measurements are distributed and 
the local solutions of the optimization algorithms are computed 
in the clusters. In the attribute distributed approach, we assume 
that the state or attack vector variables are distributed and local 
estimates are computed in the clusters. 

If we ignore communication times required to gather and 
broadcast the locally estimated vectors and local variables 
Ui, then the computational complexity of Algorithm 2 is 
dominated by the x^-update step in (l27T i. Vi = 1,..., G. Since 
the partitioned Jacobian matrix is used in dZTl i. the compu¬ 
tational complexity of dZTl i is 0(af), where at = min(A^i,iA) 
in each cluster Qi. Then, the complexity of Algorithm 2 is 
T 2 e max^f'O(af),... since a central processor 

which employs the third step of the algorithm should wait 
to gather all the local estimates from the processors in the 
clusters. If we define the maximum communication complexity 
of gathering the local data as Tg and that of broadcasting 
as Tf,, then the complexity of Algorithm 2 is increased to 

T2 + Tg + Tfa. 

Similarly, G parallel regularized least squares problems 
are solved in Gi variables in the x^-update step (ITST i of 


Algorithm 3. Since data partitioning by attribute is employed, 
the computational complexity of (iTSl l is 0 (af), where ai = 
Similarly, the complexity of Algorithm 3 is 
T 3 e max ^t'O (af),... ,t'0 {a q)^, and the communication 
cost increases the complexity to T 3 + Tg + T;,. 

In the implementations, several practical tricks such as 
caching can be used to decrease the computational complexity 
of the local optimization algorithms dZTl i and (ITST i. For further 
details, please refer to m- 

VII. Numerical Results 

In this section, the validity of the proposed algorithms is 
assessed by numerically evaluating the performance of the 
algorithms for IEEE 9-Bus, IEEE 30-Bus, IEEE 57-Bus and 
IEEE 118-Bus test systems II 2 TI . Eor each data point 100 
realizations are simulated. Eor all simulation results, A is fixed 
as ifThl 

X = GXma., (47) 

where G is a constant, Xmax = ||Hz||oo for distributed sparse 
state vector estimation methods, and Xmax = ||Ha||oo for 
distributed attack models. In addition, Xmax can be considered 
as a critical value of the regularization parameter A above 
which the estimated state and attack vectors take zero values. 
Consequently, G determines the sparsity of the solutions of the 
optimization problems and the number of iterations required to 
obtain the solutions, i.e. the estimated state and attack vectors. 
Eor that reason, an optimal A or G is computed by analyzing 
the solution (or regularization) path of the optimization algo¬ 
rithms using a given training dataset. A detailed analysis of 
the impact of G on the number of algorithm iterations required 
to obtain an optimal solution is given in mg) for ADMM 
implementations of LASSO type algorithms. We choose the 
penalty parameter as p = 1 , the absolute tolerance as 10 “^, 
the relative tolerance as 10 "^ and set the maximum number 
of iterations t' = 10000 . 

In the experiments, it is assumed that the attacker has 
access to k measurements. Eor each realization, a fc-sparse 
attack vector, a, is randomly generated by selecting the non¬ 
zero indices following a uniform distribution and Gaussian 
distributed amplitudes with the same mean and variance values 
as z. Eor distributed instances of the problem, the number 
of clusters, G, is uniformly distributed from the set of all 
prime divisors of N. On the other hand, for the collaborative 
instances, G is chosen from the set of all prime divisors of D. 

A. Results for Centralized Data Injection Attacks 

In order to assess the performance, the following parameters 
are computed in the simulations: 

1 ) Pr{ai + 0,ai + 0 ) or simply Pr(a',a'i), which is the 
probability of correctly constructing an attack variable 
a^ + 0 of a false data injection vector a. 

2 ) Pr{ai = 0,ai = 0 ) or simply Pr{ai,ai), which is the 
probability of correctly constructing a secure variable 
ai = 0 of a false data injection vector a. 

Since Pr(a',a'i) + Pr{ai,a'i) = 1 and Pr{ai,ai) + 

Pr(a',ai) = 1 , probabilities of incorrect constructions can 
be computed from the results. 
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(a) Pr(a^,a'i) for TLA (b) Pr(ai,ai) for TLA 



(a) Pr(a',a'i) for SLA 


(b) Pr{ai,ai) for SLA 



(c) Pr{a^,a'i) for TSA (d) Pr{ai,ai) for TSA 


Fig. 1: False data vector construction probabilities for for TLA 
and TSA. 



(c) Pr(a^,a'i) for SSA (d) Pr{ai,ai) for SSA 


Fig. 2: False data vector construction probabilities for SLA 
and SSA. 


False data construction probabilities of Targeted LASSO 
Attacks (TLA), Strategic LASSO Attacks (SLA), Targeted 
Selective Attacks (TSA) and Strategic Selective Attacks (SSA) 
are compared in the following. 

In Figure [H the experiments for TLA and TSA are analyzed 
and the changes of false data vector construction probabilities 
are depicted for a varying number of attack variables, for 
each test system. The construction probabilities do not increase 
or decrease smoothly for TLA (Figures [T]a and[T]b.), since A 
is computed dynamically by (l47l l for each realization and test 
system. Therefore, sparseness is not explicitly controlled in 
LASSO Attacks. On the other hand, the dynamic computation 
of A using (l47l i enables estimation of the sparseness of the 
attacks and the randomness in H. Therefore, the false data 
vector a is constructed with similar probabilities independent 
of the test system and sparsity level ^ of the attack vectors 
in the TLA case. For instance, Pr(a',a'i) obtains values 
in the range [0.5,0.7] in Figure [T] a and Pr{ai,ai) obtains 
values in the range [0.3,0.5] in Figure [T]b. Since sparseness 
can be controlled in Selective Attacks, a smooth change of 
the construction probabilities of false data vector variables is 
observed for TSA in Figure [Tjc and Figure [Tjd. 

Note that, if the regularization parameter for LASSO is 
optimized, the solution vectors for LASSO and Regressor 
selection algorithms coincide M- In Figure |2] it can be seen 
that similar solutions are attainable, i.e. both methods construct 
similar attack vectors. For instance, we observe that the 
attacked variable construction probabilities increase similarly 
in Figures |2]a and |2]c, while secure variable construction 
probabilities decrease similarly in Figures |2]b and|2]d for SLA 
and SSA. 


B. Results for Attack Detection using Distributed Sparse State 
Vector Estimation 

In this work, our primary interest from the network oper¬ 
ator’s point of view is the distributed estimation of the state 
vectors. In this section, we analyze the proposed state vector 
estimation methods by employing them for the attack detection 
problem using a modified Normalized Residual Test (NRT) 
procedure 0 . 

In the attack detection procedure, we first estimate the state 
vectors x using the algorithms proposed in Section III. Then, 
the error of the system is computed as ||z-Hx ||2 and the resid¬ 
ual of an observed measurement i is given by \\zi - (Hx)^ |||, 
where (•)i denotes the i-th element of the argument vector. 
Following the classical detection criterion, it is declared that 
the observation i is attacked if \\zi - (Hx)i||| > r. Since 
our goal is to detect the attacks on specific measurements, 
we do not remove the attacked measurement vectors at each 
iteration of the algorithm unlike the NRT method proposed 
in 0 . In addition, such a removal process disturbs the data 
space. Therefore, the proposed estimation methods should be 
re-implemented and the regularization parameters should be 
re-estimated on the updated datasets, leading to additional 
computational costs. 

In the experiments, both algorithms operate with fixed pa¬ 
rameter C = i for the regularization parameter A. In addition, 
T is chosen as 2 ^„||/- ||oo, where 

and are the variance and the covariance matrix of the noise 
n in ([T]i respectively, as suggested in ||2^ . 

In this section, we construct the attack vectors using Ran¬ 
dom False Data Injection Attacks when the attacker has access 
to any k meters to construct fc-sparse attack vectors a, as 
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Fig. 3: Experiments for the IEEE 57-bus test system with various G values. 


suggested by Liu, Ning and Reiter m. 

Performance indices Precision (Free), Recall (Rec) and 
Accuracy (Acc) are defined as 


Free = 


tp 

tp+fp ’ 


Rec = 


tp 

tp+fn ’ 


Arr = 

tp+tn+fn+fp 


, (48) 


where true positive (tp), true negative (tn), false positive (fp), 
and false negative (fn) are defined in Table [III Eor instance, 
tp represents the number of attacked measurements that are 
correctly detected. On the other hand,^ represents the number 
of secure measurements that are wrongly declared as attacked. 
Note that. Free is equal to Fr{a.i = a^), which is the 
probability that a network operator can successfully detect k 
specific attacks for all + 0. 


TABLE II: Definitions of tp, fp, tn, and fn 


Classified as Attacked 
Classified as Secure 


Attacked Secure 


tp 

fp 

fit 

tn 


Results for different numbers of measurement clusters, G, 
are considered in Eigure [3] In this experiment, each operator 
has access to locally observed measurements, state vectors and 
submatrices. The simulated cases are G = \N\ and G = \D\, for 
distributed and collective state estimation algorithms respec¬ 
tively. Therefore, G = |A^| and G = \D\ are the extreme cases 
for distributed processing scenarios. However, the algorithms 
have similar performance for different values of G in Eigure |3 
which shows that the optimality loss with respect to centralized 
strategies is small in the simulated settings. 



(a) Distributed detection for 9-bus. (b) Collective detection for 9-bus. 



0.2 0.4 0.6 0.8 


(c) Distributed detection for 57-bus. (d) Collective detection for 57-bus. 


Eig. 4: Distributed estimation performance indices for IEEE 
9-bus and IEEE 57-bus test systems. 


Eigure |4] shows the results of distributed and collective state 
vector estimation algorithms for G = 1. Note that, the case 
G = 1 represents a centralized processing scenario in which 
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(a) Pr(a'^,a'i) of distributed attacks. (b) En'or of distributed attacks. 


the tuning of the regularization parameter. For instance, the 
injection vectors are computed and the optimization variables 
are updated locally in each group with respect to a global 
regularization parameter in distributed and collective attacks. 
In other words, the group-wise local regularization paths (i.e., 
the set of solutions) are computed and used to approximate a 
global regularization path. Since the paths of Group LASSO 
are piecewise differentiable, approximating the global path by 
the local paths may be a challenge as G increases. A solution 
to this challenge is to compute group-wise parameters in an 
adaptive scheme 1281. 
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(c) Pr(a^,a'i) of collective attacks. (d) Error of collective attacks. 

Fig. 5: Experiments for the IEEE 57-bus test system. 


all of the observed measurements and the whole Jacobian 
measurement matrix are available to network operators. It 
can be seen that the performance values of distributed and 
collective estimation methods are similar for the IEEE-9-bus 
test system in Eigure|4]a and Eigure|4]b respectively. However, 
the results in Eigures |4]c and Hid show that for low values 
of k/N the precision fluctuates and stabilizes around 0.9 as 
k/N increases for the IEEE-57-bus test system. In addition, 
the slopes of the curves representing the increases of accuracy 
and recall values are slightly smaller in Eigures |4lc and |4ld 
than the ones for the IEEE-9-bus test system case. 

C. Results for Distributed and Collective Sparse Attacks 

In order to measure the detectability of the attacks from the 
perspective of the network operators. Error = || 2 :i-(Hx)i||| is 
considered. Throughout this section, both algorithms operate 
with fixed parameter G = |. 

In Eigure|5] the results of the distributed attack experiments 
for the IEEE-57-bus test system are shown. Remarkably, the 
proposed algorithms are capable of successfully injecting data 
with high probability for a large range of sparsity ranges. 
However, it can be seen in Eigures|5]a and|5]b that Pr(a', a'^) 
decreases and Error increases as G decreases. This is due 
to the fact that the optimization is very sensitive to the 
optimization of the regularization parameter A. Interestingly, 
in the simulation settings evaluated for this paper, it has been 
observed that the proposed algorithms are more robust to varia¬ 
tions of A when smaller values of G are considered. It is known 
ll27l that the optimization of the regularization parameter in 
the centralized case is hard. Surprisingly, as the fragmentation 
of the optimization problem increases, i.e., for lower values 
of G, the performance of the algorithm is less sensitive to 


VHI. Conclusion 

In this paper, we have considered centralized and distributed 
models for sparse attack construction and state estimation 
in the smart grid. Eor a centralized scenario, two methods, 
LASSO Attacks and Selective Attacks, have been introduced 
for the construction of false data vectors and attack vectors for 
a given attack model. The presented methods are used in two 
well-known attack models. Targeted and Strategic Attacks. 

We have shown that Selective Attacks provide control of 
the sparsity of the attack vectors, explicitly. Therefore, a con¬ 
struction method has been proposed for false data and attack 
vectors, which contain a given number of attacked and secure 
variables. Incidentally, the randomness of the parameters of 
the attack models may decrease the unobservability of the 
attack vectors and the control for the construction of false data 
vectors. Eor instance, random construction of the sub-matrices 
in Targeted Attacks may inject additional randomness into the 
probabilities of constructing false data vectors a. 

Eor the case in which the distributed nature of the network 
is considered, new distributed sparse state vector estimation 
and attack detection methods have been introduced. In the 
Distributed State Vector Estimation method, it is assumed 
that the observed measurements are distributed in clusters in 
the network. The state vectors are estimated using local data 
measurements in the clusters by either local network operators 
or PMUs. The estimates are then updated by centralized 
processors. In Collaborative Sparse State Vector Estimation, 
operators estimate a subset of variables of the state vectors. 
Therefore, state vector variables are assumed to be distributed 
in groups and accessed by the network operators locally. In this 
scenario, network operators compute their local estimates and 
send the estimated values to a centralized network operator in 
order to update the estimated values. 

In the experiments, it has been observed that both state 
vector estimation methods perform similarly for a varying 
number of attacks in different test systems. Besides, accuracy 
and precision values of the proposed methods decrease as 
the system size increases and the performance values do not 
change as the number of clusters increases. In other words, 
we can achieve similar performance when we implement the 
algorithms in centralized (G = 1) and massively distributed 
scenarios {G = N or G = D). 

When the Distributed Sparse Attacks model is considered, 
it is assumed that attackers process only local measurements 
in order to achieve a consensus for attack vectors. In the 









11 


Collective Sparse Attacks case, the topological information of 
the network and the measurements is available to attackers. 
However, the attackers employ attacks on variable groups of 
state vectors. 

It has been observed in the experiments that the Collective 
Sparse Attacks model performs better than the Distributed 
Sparse Attacks model for the construction of unobservable at¬ 
tack vectors. Surprisingly, better performance of the algorithm 
with higher G values is achieved than smaller G values when 
larger systems are considered. This is due to the fact that one 
of the challenges of the proposed methods is the estimation of 
algorithm parameters, e.g., the maximum number of iterations 
and the regularization parameter A. For the case in which the 
sparsity degree, k, of the solution vectors are known a priori, 
regressor selection algorithms can be employed in order to 
control the sparsity of the solutions. 
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